Improved speech recognition word lattice translation by confidence measure
نویسندگان
چکیده
In conventional speech translation systems, Automatic Speech Recognition (ASR) produces a single hypothesis which is then translated by the SMT system. The translation results of SMT system are impaired by the word errors of the first best hypothesis in this approach more or less. To improve speech translation, we use a new word lattice translation approach which integrates multiple information sources from the speech recognition word lattice to discount the misrecognition. Furthermore, in order to improve speech translation and to reduce computation, we used N-bests cutoff, merging of identical word ids, and confidence measure. Experiments of Japanese-to-English speech translation showed that the proposed word lattice translation outperforms the conventional single best method.
منابع مشابه
A decoding algorithm for word lattice translation in speech translation
We propose a novel statistical machine translation decoding algorithm for speech translation to improve speech translation quality. The algorithm can translate the speech recognition word lattice, where more hypotheses are utilized to bypass the misrecognized single-best hypotheses. We also show that a speech recognition confidence measure, implemented by posterior probability, is effective to ...
متن کاملIntegration of speech recognition and machine translation: Speech recognition word lattice translation
An important issue in speech translation is to minimize the negative effect of speech recognition errors on machine translation. We propose a novel statistical machine translation decoding algorithm for speech translation to improve speech translation quality. The algorithm can translate the speech recognition word lattice, where more hypotheses are utilized to bypass the misrecognized single-b...
متن کاملImproving Speech-to-Speech Translation Using Word Posterior Probabilities
Nowadays, speech translation is a research problem in machine translation. The problem arises as to how to combine speech recognition and machine translation in a suitable way. Some authors have shown that the speech translation can be improved by using word lattices as input of the translation system. The acoustic recognition scores from the word lattice are used for improving the translation ...
متن کاملExploiting deep neural networks for detection-based speech recognition
In recent years deep neural networks (DNNs) – multilayer perceptrons (MLPs) with many hidden layers – have been successfully applied to several speech tasks, i.e., phoneme recognition, out of vocabulary word detection, confidence measure, etc. In this paper, we show that DNNs can be used to boost the classification accuracy of basic speech units, such as phonetic attributes (phonological featur...
متن کاملUsing Word Lattice Information for a Tighter Coupling in Speech Translation Systems
In this paper we present first experiments towards a tighter coupling between Automatic Speech Recognition (ASR) and Statistical Machine Translation (SMT) to improve the overall performance of our speech translation system. In coventional speech translation systems, the recognizer outputs a single hypothesis which is then translated by the SMT system. This approach has the limitation of being l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005